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We show several results related to interactive proof modes of communication complexity. First 
we show lower bounds for the QMA-communication complexity of the functions Inner Product 
and Disjointness. We describe a general method to prove lower bounds for QMA-communication 
, complexity, and show how one can 'transfer' hardness under an analogous measure in the query 

complexity model to the communication model using Sherstov's pattern matrix method. Combin- 
ing a result by Vereshchagin and the pattern matrix method we find a communication problem 
^ ' with AM-communication complexity O(logn), PP-communication complexity n(n^^^), and QMA- 

communication complexity n(n^^^). Hence in the world of communication complexity noninterac- 
tive quantum proof systems are not able to efficiently simulate co-nondeterminism or interaction. 
These results imply that the related questions in Turing machine complexity theory cannot be 
f-^r^ • resolved by 'algebrizing' techniques. Finally we show that in MA-protocols there is an exponential 

^SJ , gap between one-way protocols and two-way protocols (this refers to the interaction between Al- 

' ice and Bob). This is in contrast to nondeterministic, AM-, and QMA-protocols, where one-way 

communication is essentially optimal. 

1 Introduction 



In their seminal 1986 paper on 'Complexity Classes in Communication Complexity' |BFS86] Babai et 
^ , al. define, among a host of other classes, the communication complexity analogues of the interactive 

?H I proof classes AM and MA. In this context, for instance, the class MA consists of all communication 

problems that have MA-communication complexity at most poly log n. The MA-communication 
complexity is the optimal complexity of a protocol solving a communication problem with the help 
of a prover Merlin, and two verifiers Alice and Bob (Alice and Bob each see only their part of the 
input, while Merlin sees the whole input. However, Merlin cannot be trusted). Merlin sends a proof 
followed by a discussion between Alice and Bob. See Section 2.1 for definitions. 

While the Turing machine versions of AM and MA (capturing interactive proof systems with a 
constant number of rounds between the prover and verifier resp. noninteractive, but randomized proof 
systems |B85l IBM88j ) played a crucial role in the subsequent development of theoretical computer 
science, their communication complexity analogues were probably considered too esoteric a topic to 
merit much consideration. One of the few results about them is a 2003 lower bound of f^{^/n) for the 
MA-communication complexity of the Disjointness problem Disj by this author [K03j (Disj is co-NP 
complete in the world of communication complexity). The same paper also relates these complexity 
measures to the power of the rectangle (aka corruption) bound in communication complexity. In 
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2004, Raz and Shpilka |RS04j proved that there is a problem, for which its quantum communication 
complexity is exponentially smaller than its MA-communication complexity, providing another MA- 
communication complexity lower bound, as well as a problem, for which its Quantum MA (short 
QMA-) communication complexity is exponentially smaller than both the quantum (without prover) 
and the MA-communication complexities. Raz and Shpilka left proving lower bounds for the QMA- 
communication complexity of any function as an open problem. 

Surprisingly, in 2008 Aaronson and Wigderson |AW09j not only showed that the mentioned lower 
bound for Disj is basically tight, but also gave a new incentive to understand the relations between 
the complexity classes in the world of communication complexity. Their paper investigates a new 
'barrier' in complexity called algebrization. Without going deeper into this topic one of their results is 
that a separation between two communication complexity classes shows that the algebrization barrier 
applies to their Turing machine analogues, i.e., an attempt to show that the two classes are the same 
will encounter said barrier. 

A restricted model of MA (and QMA) one-way communication complexity has been investigated 
by Aaronson |A06] . Very recently Gavinsky and Sherstov [GSIO] have shown that co-NP is not a 
subset of MA in the model of multiparty communication complexity. 

One motivation for investigating proof systems in communication complexity is to understand 
the power of proofs. The results in [RS04| show that in a setup where we can actually prove such 
statements, quantum proofs (of type QMA) are more powerful than classical proofs. However, many 
questions about this topic remained open: is AM larger than MA? Is co-NP a subset of QMA? Is 
QMA a subset of AM? In this paper we resolve some of these questions. 

In our first result we show a lower bound of Q(n^^^) for the QMA-communication complexity of 
Disj. This means that co-NP is in fact not a subset of QMA in the world of communication complexity, 
and that trying to put co-NP into QMA in the 'real world' (besides the inclusion probably being 
false) would require a nonalgebrizing technique. 

We also show how a lower bound method we call 'one-sided discrepancy' gives lower bounds for 
QMA-communication complexity 0. Furthermore we show a (basically tight) lower bound of J7(\/n) 
for the QMA-communication complexity of the function IP2, the function that computes the inner 
product modulo 2. It is an interesting question, whether the bound for Disj is tight, because we have 
two very different upper bounds of the order 0{^/n) for this problem: the MA-protocol from |AW09j . 
and the Grover based quantum protocol from e.g. |AA03j . Is it possible to combine these protocols 
in some way to get a more efficient QMA protocol? 

Our second main result shows that there is a partial function /, for which the weakly unbounded 
error communication complexity PP{f) is f2(n^/^), whereas the AM-communication complexity is 
only O(logn). The PP lower bound immediately implies a 0(n^/^) lower bound for the QMA- 
communication complexity of the same problem. Hence here the tiniest amount of interaction in 
classical proof systems (the verifiers may challenge the prover with a public coin message) cannot 
be simulated by a noninteractive proof system even with the help of quantumness. In terms of 
algebrization this result shows that putting AM into PP (or even into MA) needs nonalgebrizing 
techniques. On the other hand it is widely believed that AM=NP because it might be possible to 
derandomize AM. 

The result has another implication: Since PP-complexity coincides with the discrepancy bound 
[K07] . it turns out to be possible for a function to have polynomial size rectangle covers of its 1-inputs 
with small error under every distribution on the inputs, yet for some distribution on the inputs every 
individual rectangle is either exponentially small, or has error exponentially close to 1/2. That means 
any attempt to prove lower bounds for AM-protocols by considering the properties (error and size) 

^Essentially the same method has been used by Gavinsky and Sherstov for the multiparty version of MA- 
communication complexity [GSIO) . 
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of individual rectangles alone must fail. We believe that it is important to show lower bounds for 
AM-protocols, because new techniques developed for this problem need to get past this 'rectangle 
barrier'. Furthermore such a proof would be a first step towards resolving the 112 7^ ^2 problem in 
communication complexity, one of the biggest problems left open by |BFS86j . 

Finally, our third result considers the structure of MA-protocols. In general, nondeterministic 
protocols require no nontrivial interaction between Alice and Bob, i.e., after seeing the proof, Alice can 
just send one message to Bob, who accepts or rejects. The same is trivially true for AM-protocols 
(with our definition). Interestingly, and somewhat counterintuitively, Raz and Shpilka show that 
(within a polynomial increase in communication) one-way communication is also optimal for QMA- 
protocols. We show that there is a problem, for which one-way MA-communication is exponentially 
worse than two way randomized communication. This highlights the difference between quantum 
and classical proofs, and is somewhat reminiscent of the fact, that in the 'real world', quantum 
proof systems in the class QIP can be parallelized to only 3 rounds |KWOO] . whereas a similar 
parallelization of classical proofs would collapse the polynomial hierarchy (here parallelization refers 
to the interaction between the prover and the verifier). 

2 Definitions and Preliminaries 

2.1 Arthur Merlin Communication Complexity Definitions 

For definitions of more standard modes of communication complexity we refer to Kushilevitz and 
Nisan's excellent monograph [KN97]. 

In Arthur Merlin communication games, there are 3 parties Merlin, Alice, Bob. All of them are 
computationally unbounded. Alice sees her input x G {0, 1}", Bob his input y S {0, 1}", and Merlin 
sees both inputs. Merlin is the prover, who wants to convince the verifier, consisting of Alice and 
Bob together, that f{x,y) = 1. 

Definition 1 In a Merlin-Arthur protocol (short MA-protocol) for a Boolean function f Alice ini- 
tially receives a message (also called the proof j from Merlin. After this Alice and Boh communicate 
until they compute an output, using public key randomness (the proof cannot depend on the random- 
ness). The cost of an MA-protocol is the sum of the length a of the proof, and the length c of the 
overall communication between Alice and Bob. The protocol computes f, if for all inputs x,y with 
f{x,y) = 1 there exists a proof such that x,y is accepted with probability p and for all inputs x,y 
with f{x,y) = and all proofs the probability that x,y is accepted is at most q. p must be at least a 
constant factor larger than q. p is the completeness, q the soundness of the protocols. We will call 
max{l —p, q} the error of the protocol, and frequently consider protocols with very small error. If not 
mentioned otherwise we assume p = 2/3 and q = 1/3. 

The Merlin-Arthur complexity of f , denoted MA(f), is the smallest cost of an MA-protocol for 
f . The MA- complexity with bounded proof length a is denoted MA^^^f). 

Note that the error probabilities (resp. the soundness and completeness) of MA-protocols can be 
improved arbitrarily by using standard boosting techniques. For this the proof itself does not need 
to be repeated, so the proof length is not increased. 

Also note that including the proof length in the cost is crucial, because otherwise Merlin could 
provide Alice with a copy of y, whose correctness could be checked with a standard fingerprinting 
protocol, decreasing the complexity of all functions to 0(1) (due to public coins being available). 

Definition 2 In an Arthur Merlin (short AM-) protocol, Merlin, Alice, and Bob share a source of 
random bits. First a random challenge is drawn from this source (of a predefined length). Merlin then 
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produces a message (called the proofj, which is sent to Alice. After this Alice and Bob communicate 
until they either accept or reject. They may not use fresh random bits at this stage, i.e., all random 
bits are known to Merlin. The cost of an AM-protocol is the sum of the length of the proof and the 
length of the communication between Alice and Bob. 

The protocol computes f, if for all inputs x,y with f(x,y) = 1 with probability at least 2/3 there 
exists a proof such that x,y is accepted, and for all inputs x,y with f{x,y) = with probability at 
most 1/3 there exists a proof such that x,y is accepted. The Arthur Merlin complexity of f , denoted 
AM{f), is the smallest cost of an AM-protocol for f. 

Note that a more generous definition is possible, in which Ahce and Bob still have access to private 
random bits after receiving the proof. We prefer to call such protocols AMA-protocols, because our 
definition of AM-protocols is combinatorially cleaner and strong enough for our separation result. 
Note that by standard techniques from the theory of Arthur Merlin games \B85\ IBM88| both MA- 
and AMA-protocols can be at most quadratically cheaper than AM-protocols, while we later show 
that AM-protocols can indeed be exponentially more efficient than MA-protocols. 

Also note that AM-protocols need only one round of communication between Alice and Bob: to 
simulate any more complex protocol, Merlin can include the whole conversation between Alice and 
Bob in his proof, who just need to check if their part was represented properly. 

We now define a quantum version of MA-protocols. 

Definition 3 In a quantum Merlin-Arthur (short QMA-)protocol, Merlin, produces a quantum state 
p (the proof j on some a qubits, which he sends to Alice. Alice and Bob then communicate using a 
quantum protocol, and either accept or reject the inputs x,y. We say that a QMA-protocol computes 
a Boolean function f, if for all inputs x,y with f{x,y) = 1, there exists a (quantum) proof, such 
that the protocol accepts with probability at least p, and for all inputs x, y with f{x, y) = 0, and all 
(quantum) proofs, the protocol accepts with probability at most q. Again, we require p ^ q, and we 
set them to 2/3 resp. 1/3 if not mentioned otherwise. The cost of a QMA-protocol is the sum of a 
and the length of the communication between Alice and Bob. The cost of the cheapest protocol that 
computes f defines QMA{f). The QMA-communication complexity with bounded proof length a is 
denoted by QMA^^Xf). 

Let us first note that, surprisingly, the error probability of QMA-protocols can be reduced without 
repeating the quantum proof, due to a clever procedure introduced by Marriott and Watrous |MW05j 
in the context of standard QMA-games. Since the proof of [MW05| uses the verifier simply as a black 
box, their construction carries over to the communication complexity scenario. Note however, that 
this boosting technique increases the number of rounds between Alice and Bob, because their message 
sequences are computed and uncomputed in a sequential manner. 

Fact 1 If there is a QMA-protocol with proof length a, communication c and error 1/3, then there 
is a QMA-protocol with proof length a, communication 0{c ■ k) and error 1/2^. 

We now turn to protocols with (weakly) unbounded error. 

Definition 4 In a weakly unbounded error protocol Alice and Bob have access to a private source of 
random bits each. The protocol computes a Boolean function f if for all inputs x, y the probability px,y 
of computing the correct output f{x,y) exceeds 1/2. The gap on input x,y is gx,y = Px,y — 1/2, the 
gap g = minx^y gx,y The cost of a weakly unbounded error protocol with worst case communication c 
is c— logg, and the weakly unbounded error complexity of a function f is PP{f ), the minimum cost 
of any protocol that computes f in the described manner. 
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There is another type of unbounded error protocols, in which the gap is not considered, but only 
the communication necessary to achieve correctness probability exceeding 1/2. We do not consider 
this model here. See e.g. |BFS86j and |RR10[ ISOSal IBVdW07j for more. 

2.2 Integer Polynomials 

In this section we consider the representation of Boolean functions by polynomials with integer 
coefficients. 

Let / : {0, 1}" — >• {0, 1} be a Boolean function. A (partial) assignment A : S ^ {0, 1}™' is an 
assignment of values to some subset S C {xi, . . . , x„} of variables. We say that A is consistent with 
X G {0, 1}" if Xi = A(i) for all i £ S. We write x S ^ as shorthand for 'A is consistent with x\ We 
write 1^1 to represent the cardinality of S (not to be confused with the number of consistent inputs). 
Furthermore we say that an index i appears in A, iff i G 5 where S is the subset of [n] corresponding 
to A. We define a function ka ■ {0, l}" — t- {0, 1} such that ka_{x) = 1 iff A is consistent with x 

Every / : {0,1}" — )■ {0,1} can be written as sign{Y^^.^^^^^'WA • KAix)), where d < n is an 
integer, the sum is over all partial assignments, and the wa are integers. We call the minimum of 
SA |A|<d I'^^^l that achieves this the threshold weight W{f,d) of / with degree d. If d is too small to 
allow representation of /, we set W{f, d) = cxd. We say that the integer polynomial sign(^^.|^|<^ wa ■ 
ka{x)) sign-represents the function /. 

Frequently in the literature (and importantly for us in Sherstov's paper [S08b]) the threshold 
weight is defined not with partial assignments, but with characters xs of the Fourier transform over 
the Boolean cube, i.e., parity functions on subsets S. Note that this changes the value of W[f,d) at 
most by a factor of T^: in order to represent the function xs with weight ws we can assign weight ws 
to all partial assignments that fix all variables in S such that the parity of the variables in 5 is 1, and 
—Ws to all partial assignments that fix all variables in S such that their parity is 0. Hence we get a 
representation using partial assignments instead of the xs with total threshold weight increased by 
a factor of at most 2'^. Conversely, one can show how given a partial assignment A with weight wa 
one can find a representation using a sum of xsi so that the overall threshold weight is increased by 
at most 2°^. We omit this since it is not important for this paper. 

2.3 Real Polynomials 

In Section 3 we will also use the representation of Boolean functions by polynomials with real poly- 
nomials. For definitions concerning this topic we refer to |BdW02j 

2.4 Pattern Matrices 

In |S08b] Sherstov introduced a method to turn Boolean functions / : {0, 1}" — t- {0, 1} into commu- 
nication problems that are hard, whenever / is hard under certain measures of complexity. Here we 
define pattern matrices. 

Definition 5 For a function f : {0, 1}" — ?■ {0, 1} the pattern matrix Pj is the communication matrix 
of the following problem: Alice receives a bit string x of length In, Bob receives two bit strings y, z 
of length n each. The output of the function described by Pf on inputs x,y,z is f{x{y) © z), where 
© is the bitwise xor, and x{y) denotes the n bit string that contains X2i-yi in position i = 1, . . . ,n. 

3 QMA-complexity of Disjointness 

In this section we prove that the Disjointness problem Disj requires QMA-communication complexity 
0(ni/3). 
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Let us first define the problem. 



Definition 6 The Disjointness problem Disj has two n-bit strings x,y as inputs. Disj(x,y) = 1 <^=^> 
A»=i,...,n(^2;i V^yi). 

Previous result about this problem are: )BFS86j prove i2(Disj) = ^l{^/n) (and observe that Disj 
is complete for the communication complexity version of co-NP). |KS92j and later [Raz92j prove the 
tight Q{n) bound. |Raz03j shows the tight ^}{^/n) lower bound for quantum protocols. This result 
was reproved in a simpler way in jS08b) . |K03) gives a Q{^/n) lower bound for MA-protocols. Finally, 
|AW09j show a 0{^/nlogn) upper bound for MA-protocols, and |AA03] give a 0{^/n) upper bound 
for quantum protocols. 

Here we prove: 

Theorem 1 QMA(Disj) = 17(ni/3). 

We are going to give two proofs of this. The first here uses Razborov's method. Following this 
(in Section 4) we describe a general method to prove QMA lower bounds, and show how Shertov's 
technique can be used to yield an overall simpler proof (when taking the proof of Razborov's method 
into account). 

Razborov's method can be summarized as follows |Raz03| . see also |KSW07] . 

Fact 2 Consider a c-qubit quantum communication protocol on n-bit inputs x and y, with acceptance 
probabilities denoted by p{x,y). Define p{i) = -E'|x|=|y|=n/4,|a:Ay|=i|[?'(^5 where the expectation is 
taken uniformly over all x,y that each have weight n/A and that have intersection i. For every 
d < n/4 there exists a degree-d polynomial q such that \p{i) — '?(^)| < 2"'^/'^'*'^'^ for all i G {0, . . . , n/8}. 

Proof of Theorem [1]. Suppose we are given a QMA-protocol for Disj with communication c and 
proof length a > 1 and error 1/3. In what will become a recurring theme, we first amplify the success 
probability to 1/2^*^" by employing Marriott- Watrous boosting (Fact[T]). We end up with a protocol 
that still has proof length a, but now the communication is c' = 0{ac). We will show that this 
protocol needs communication at least ^l{^/na), which implies the theorem. 

At this point we simply replace Merlin's proof with the totally mixed state. We end up with an 
ordinary quantum protocol, that has the following properties: 

1. All 1-inputs of Disj are accepted with probability at least (1 — 2~^'^'^)/2'^. 

2. No 0-input of Disj is accepted with probability larger than 1/2^*^". 

Now we can simply invoke FactEJ We set d = 12c'. Then we receive a polynomial q, such that 
1. the degree of q is d. 



2. 1 + 2-""' > q{0) > (1 - 2-i0")/2" - 2"^'. 

3. -2-=' < q{i) < 2-10" + 2-"' for all 1 < i < n/8. 
Now we define a rescaled polynomial q' = 1 — q/q{0). 

1. The degree of q' is d. 

2. q'{0) = 0. 

3. 1 + 2-^7(1 + 2"^^') > q'{i) > 1 - {2-^°" + 2-^') /{{I - 2-i0")/2" - 2'^') > 1 - 2'^" for all 



1 < i < n/8. 
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The resulting polynomial must rise very steeply between q'{0) and q'{l). We can apply a result 
by Buhrman et al. [BCWZ99] . their Theorem 17. 

Fact 3 Every polynomial s of degree d < M — 1 such that s(0) = and 1 — e < s{x) < 1 for all 
integers i S has 

g y }_^-vd'^ /{M-l)-8d/V'M 
~ U ' 

where u, v are constants. 

Setting M = n/8, and rescaling q' slightly, we can use this fact to see that d > ri(-^/na) in order 
to enable e < 2-^("). 

Hence c' > i}{y/na), and ^/ac > ^}{y/n). This implies that a + c > Q{n^^^), which is our theorem. 

□ 



4 A Lower Bound Method for QMA-protocols 

In this section we develop a general method to prove lower bounds for QMA protocols, and we show 
how to use the pattern matrix method |S08b] for QMA-protocols. 

4.1 A Discrepancy Measure 

Let us start with the familiar notion of the discrepancy bound in communication complexity (see 
[KN97]). 

Definition 7 The (rectangle) discrepancy of a Boolean function f under a distribution fj, is 

disc'^(/) ^ 



maxn |/i(/-i(0) nR)- fiif-\i) n ' 

where the maximum is over all rectangles R in the communication matrix. 
The discrepancy of f is disc(/) = max^ disc^(/). 

The following linear program (see [JKlOj ) characterizes discrepancy. In the following TZ denotes 
the set of all rectangles in the communication matrix. 

Primal Dual 

min wr + VR max fi^^y 

ReTZ (x,v) 

V(3;,y) G /"^(l) : WR-VR>1, \fR: ^ - ^ ^t^.y < 1, 

R-{x,y)eR (a:,y)e/-i(i)nfl (x,y)eRnf-^(0) 

V(a;,y) e /~'(0) : ^ 7;fl-u-fl>l, VR: ^ fi^.y ~ ^ fJ-x,y < h 

R:(x,y)eR (x,y)ef-^(0)nR (a; ,h) S-Rn/ - 1 (1) 

Vi? : WR,VR > . V(2;,y) : ^x,y > . 

The rectangle discrepancy characterizes the weakly unbounded error communication complexity 
PP{f), and serves as a lower bound for bounded error quantum and randomized communication. 
For the bounded error modes it often yields only very poor results. Here is the relation to PP- 
communication complexity |K07] . 

Fact 4 For all Boolean functions f : {0, 1}" x {0, 1}'" {0, 1} we have PP{f) > ri(logdisc(/)) and 
m/) <0(logdisc(/) + logn). 
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Now we define a lower bound method that we will use for QMA-communication complexity, the 
one-sided smooth discrepancy. It is similar to the smooth discrepancy |K07l ISOSb] . in which the 
primal linear program for discrepancy is augmented with additional upper and lower bounds. Here 
we augment the program only for the 0- inputs. Essentially the same method (in what we later define 
as its 'natural' version) was used recently by Gavinsky and Sherstov in the setting of multiparty 
protocols and MA-communication |GS10] . 

Definition 8 (One-Sided Smooth Discrepancy) Let f : {0, 1}" x {0, 1}" — > {0, 1} be a Boolean 
function. The one-sided smooth discrepancy of f, denoted sdisc^(/), is given by the optimal value of 
the following linear program. 

Primal Dual 

mill ^ WR + VR max ^ fi^^y — (1 + (^)(px,y 

V(a::,y) e : ^ WR-VRyi, Vi? : ^ fix,y - ^ {f^x.y - <l}x,y) < 1, 

R:{x,y)eR (a:,j;)g/-l(l)nfl (x ,y)eRnf - ^ (0) 

y{x,y) e f-\0) : l + e> ^ vr~wr>1, \fR: ^ (fix.y ~ <t}x,y) - M^.y < 1' 

R:(x,y)eR (a;,j/)e/-l(0)nfl (a; ,1,) S-RO/ - 1 (1) 

\/R:wr,vr'>Q . \J[x, y) : fj,^^y > 0; > . 

Note that for all {x,y) G /~^(1) : (px,y = in an optimal solution. We are now looking for a 
'natural' definition of one-sided smooth discrepancy, i.e., a definition in which the one-sided smooth 
discrepancy of a function / is related to the discrepancy of a function g that is similar to /. The 
value of one-sided smooth discrepancy will be the discrepancy of g under a distribution u. The above 
dual shows us that we should have /~^(1) C ^"^(1). Furthermore, not too many 0-inputs of / should 
be 1-inputs of g. It is also quite easy to see that for no input 4^x^y ^ and l^x^y ^ simultaneously 
in an optimal solution to the dual. 

Below we present the natural definition of one-sided smooth discrepancy. 

Definition 9 (One-sided Smooth Discrepancy, Natural Definition) Let f : {0, l}"x{0, 1}" — > 

{0,1} be a Boolean function. The 5-one-sided smooth discrepancy of f , denoted s6\sc^{f ), is defined 
as follows: 

sdisc^(/) =^ maxjsclisc^' (/) : A distribution on {0, 1}" x {0, 1}"}. 

sdis4'\/) =^ max{disc^(5) such that g : {0, 1}" x {0, 1}" {0, 1}, 
f-\l) C g-\l) and XU'\l)) > 5 ■ Xig-Hl))}. 

The following lemma shows the equivalence of the two definitions of one-sided smooth discrepancy. 



Lemma 1 Let f : {0, 1}" x {0, 1}" {0, 1} be a function and let 5 > 0. Then 



1. sdiscL(/)>sdiscK/). 



2. 5-sd\sc\s{f)<s6\sc]{f). 



Proof. 
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1. Suppose sdisc5(/) > K. Then there is a solution /x, (/> to the dual program that achieves value 
K. We have to define a function g and a distribution A. W.l.o.g. we can assume that for every 
x,y either ^x,y = or (px,y = 0. We define g{x,y) = f{x,y) when cp^.y = 0, else g{x,y) = 1. 
Clearly, f~^{l) C ^"^(l). Furthermore define L = Yj{x,y) t^x,y + <Px,y Set \x,y = {Hx,y + (t>x,y) / L. 
The constraints of the dual now imply that disc'^(5) > L > K. 

Denote /xq = Ex,y:f{x,y)=o ^^x,y/L, and m = Ex,y:f{x,y)=i ^ud ^ = J2x,y ^x,y/L. Since 

the set of all inputs is a rectangle, we have that /xq < 1/2 + ^/L^ and 1/2 + 1/L > + i;^ > 
1/2 - 1/L. 

Assume that A(/-Hl)) < l'^)' Kd'^ i"^))- This means that ;Ui < (V3)-(;Ui+0) < {6/Q) + l/L. 
Then 

/XI + /xo - (1 + < V6 + 1/2 - (1 + -5)(l/2 - 5/Q) + 3/L < 0, 
and the objective function would be negative, which is impossible. 

2. Suppose sdisc25(/) > K. Then there are a function g and a distribution A as in the definition 

of sdisc25. For all x,y with g{x,y) = 1 but f{x,y) = we set (l)x,y = ^x,y • K, for all other x,y 
we set fix,y = ^x,y ■ K. All other variables are set to 0. 

Clearly, all constraints of the dual are satisfied. Define fiQ = Ylx yf{x y)=o ^^x,y1 ^"^^ /"i ~ 
yf{x y)=i f^x,y *^ ~ y 4'x,y Bccausc the set of all inputs is a rectangle, we have that 
/xq > K/2 — 1, and K/2 + 1 > /Ui + > K/2 — 1. The objective function is 

/XI + /xo - (1 + (5)(/> > <5i^ - 1 + i^/2 - 1 - (1 + 5){K/2 + l-5K)> 5K. 

□ 

We can now show that the one-sided smooth discrepancy yields lower bounds for QMA-communication 
complexity. 

Theorem 2 Let QMA^°-\f) < c. Then log sdisc^^io. (/) < 0((a + l)c). 
QMA{f) > n (^^logsdisci(/)) . 

One immediate corollary is a lower bound for the function Inner Product mod 2 (IP2), because it 
is well known that even the discrepancy of IP2 is at most 2"'^(") [CG85]. 

Corollary 1 QMA{\P2) > l^(V^). 

Note that this lower bound is tight within a log factor due to the MA-protocol of Aaronson and 
Wigderson [AW09]. 

Proof of Theorem [2l Suppose we have a QMA-protocol with proof length a > 1, communication 
c, and error 1/3, and a + c optimal. We boost the success probability using the Marriott- Watrous 
technique Fact [TJ This gives us a QMA-protocol with proof length a, communication c' < 0(ca), 
and error 2"^^"^. We replace the proof at this point with the totally mixed state, leaving us with 
a quantum protocol that accepts all 1-inputs with probability at least (1 — 2~^^'^)/2'^, and accepts 
0-inputs with probability at most 2~^^". 

Our goal is to show, that this gives us a solution to the linear program for one-sided smooth 
discrepancy. We consider the matrix of acceptance probabilities. 
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Definition 10 For a matrix M with real entries, denote by /i(M) the minimum of^j^ \uii\ (where R 
ranges over all rectangles in the matrix M) such that M = 'YIr'^R' fR' where fn is the characteristic 
function of rectangle R. 

Linial and Shraibman |LS09| have shown the following. 

Fact ^ If M is the matrix of acceptance probabilities of a quantum protocol (with shared entangle- 
ment) with communication c, then n{M) < 0(2'^). 

So let M be the matrix we constructed before (i.e., for 1-inputs x,y the entry at position x,y is 
between 2~"'{1 — 2~^^°) and 1, and for 0-inputs between and 2"^^''). Consider the system of weights 
UR that achieve ^l{M) < 0(2"'). 

To turn this into a solution for our primal program for the one-sided smooth discrepancy bound 
we can simply multiply all the ur by a factor of 2""^^ and subtract 1 + 2~^'^"'^'^ from the ur for 
the rectangle R = {0, 1}" x {0, 1}". Then, for all R, when ur < we set vr = —ur and wr = 0, 
otherwise we set w^^ = and vr = ^. The result is a feasible solution with the parameter e < 



2-i2a+2 < 2^1°'', and the cost of the linear program is at most 0(2'' • 2"') < 0(2(''+i)^). Hence 
logsdis4_io.(/) < 0((a + l)c). 



4.2 Proving Lower Bounds for Pattern Matrices 

In this section we follow the approach of Sherstov |S08b| . which can be summarized as follows: for 
a Boolean function / : {0, 1}" — )• {0, 1}, we can define a communication problem Pj, the pattern 
matrix (see Section 2.4), and then lower bound the communication complexity of Pj in terms of some 
parameter of the function /. Sherstov uses mainly the approximate degree of / to get lower bounds 
on the smooth discrepancy of the function Pj (and hence its quantum communication complexity). 

Our goal is to relate the one-sided smooth discrepancy of /, redefined for query problems, to the 
one-sided smooth discrepancy of Pj. Thanks to the natural definition of sdisc^ (Definition [9]) it is 
actually sufficient to relate the discrepancies of / and Pf. In the next section we define discrepancy 
measures for query complexity. 

4.3 Another Notion of Discrepancy 

In this section we define a notion of discrepancy for Boolean functions (which can be used as a lower 
bound for query complexity, and in fact characterizes PP-query complexity). Here subcubes defined 
by partial assignments (see Section 2.2) take the role of the rectangles. 
We define the query complexity version of discrepancy as follows. 

Definition 11 (Discrepancy) Let f : {0, 1}'' — )• {0,1} be a function. The (polynomial) discrepancy 
of f, denoted pdisc(/), is given by the optimal value of the following linear program. 




□ 



Primal 



Dual 



mm 



max > fi; 



'X 



A 



X 




a;g/-l(l)nA 



xeA,x^f-^{l) 



WA : wa ,va>0 . 



a;SA,a:^/-l{l) 

\/x : fix > ■ 



a;GAn/-l(l) 
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We only give the 'natural' definition of the corresponding notion of one-sided smooth discrepancy. 
The linear programs are easy to state and analogous to the communication complexity versions. 



Definition 12 Let f : {0, 1}" — t- {0, 1} be a Boolean function. The 6-one-sided smooth (polynomial) 
discrepancy off, denoted spd\scl{f) , is defined as follows: 

spdisc5(/) ^= max{spdisc^'"'^(/) : A distribution on {0, 1}"}. 

spclisc^'^(/) ^= max{pclisc'''(5) such that g : {0, 1}" — )■ {0, 1}, 

f-Hl) C g-Hl) and Xif~\l)) > 5 ■ X{g'\m. 

Essentially this measure looks at a combination of threshold weight and degree of polynomials 
that sign-represent a function with a large enough gap, but without requiring integer coefficients. 
We now want to relate the query complexity version and the communication complexity version of 
discrepancy. Sherstov proved the following statement |S08bj . 

Fact 6 Let Pf be the pattern matrix of a function f , and d a positive integer. 
Then 



where W{f,d) denotes the threshold weight of f with degree d, i.e., the minimum threshold weight of 
a polynomial with integer coefficients and degree d, that sign-represents f . 

Refer to section 2.2 for an explanation of threshold weight as used here (which is slightly different 
from Sherstov's usage, leading to an extra factor of 2*^). Inspecting the proof of Fact [6] it becomes 
clear, that the integrality of the polynomial coefficients in the definition of W{f, d) is only used to 
ensure that the gap between the value of a polynomial on 1-inputs and 0- inputs is at least 1. It turns 
out we can replace W{f,d — 1) by pdisc. Furthermore, since the linear program for pdisc already 
incorporates the factors 21"^! the minimum with 2"^ is unnecessary. An additional advantage of this is 
that the program for pdisc (being linear) has a proper dual, whereas Sherstov's proof works with an 
approximate 'dual' of W{f, d) proved in a combinatorial way (leading to the square on the Ihs and 
the factor 1/n on the rhs, see his Theorem 3.4 in [S08bj ). Modifying his proof this way yields the 
following theorem. 

Theorem 3 Let Pf be the pattern matrix of a function f . Then 

disc'^M^'/) > ^^(Pdisc^(/)), 

where ^\ is a distribution, in which the inputs y,z to the communication problem Pf (see Section 
2.4) are chosen uniformly, and the input bits x{y) are chosen such that x{y) ® z is distributed as in 
A. The remaining bits in x are uniform. 

Thanks to the natural definitions of one-sided smooth discrepancy we get an analogous result for 
one-sided smooth discrepancy. Sherstov's technique allows to transfer a discrepancy lower bound for 
the function g (from the natural definition of one-sided smooth discrepancy of /) to a lower bound 
on the discrepancy of Pg. Pg (together with the distribution jix) can then be used as a witness for 
the hardness of Pf. 

Theorem 4 Let Pf be the pattern matrix of a function f. Then 

sd\sc}{Pf)>n{6-spd\scls{f)). 
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Proof. When spclisc25(/) = K, then there are a function g and a distribution A as in the definition of 
spdisc^. Then disc'^^(Pg) > r2(pdisc^((7)) by Theorem[3l and by the definition of fj,\ and the properties 
of X,f,g wehave fix{PfHl)) > 2S^lx{Pg-H^))■ 

Then sdisc25(i-*/) > ^{K), and consequently sdisc5(P/) > Q{6K) with Lemma[Tl 

□ 

Hence it is enough to analyze the one-sided smooth discrepancy of / to lower bound the QMA- 
communication complexity of Pf. 

4.4 The One-sided Smooth Discrepancy of AND 

The AND function is defined by AND{xi, . . . , x„) = xi A • • ■ Axn- Pand contains the communication 
matrix of Disj as a submatrix [S08bj . 

Lemma 2 log spdisc|/3(AA^i:') = Vt{^/n). 
logspdisc2-a(AiV-D) = ^l^/ah). 

The proof of this lemma is analogous to the corresponding part of the proof of Theorem 1 (and 
follows from results in [BCWZ99j). 

Corollary 2 QMA{Pand) = ^{n^'^). 

5 AM- vs. PP-communication 

In this section we describe a problem, for which its PP-communication complexity is exponentially 
larger than both its AM-communication complexity and its co-AM-communication complexity. Then 
we can easily show that also the QMA-communication complexity of the problem must be large. The 
lower bound for PP-communication complexity also implies that the discrepancy method cannot be 
applied to get AM-communication complexity lower bounds. This essentially means that to lower 
bound AM-communication complexity, it is not sufficient to study the properties (size and error) of 
individual rectangles. Essentially we describe a problem, for which for all distributions on the inputs 
there is an O(logn) nondeterministic protocol (i.e., a poly(n) size cover of the 1-inputs) with constant 
error, whereas there is a distribution under which each rectangle has exponentially small discrepancy, 
i.e., all rectangles are either exponentially small, or they have error exponentially close to 1/2. 

5.1 The Problem 

In [V95j Vereshchagin describes a similar separation for query complexity. We start with his Boolean 
function, which is a relaxed version of the Minsky-Papert function |MP88j . 

Definition 13 Let M he a matrix in {0,1}"^"^. M is good, if every row of AI contains a 1. M is 
6-bad, if at least 5n of its rows contain only zeros. 

The function AppMP takes such matrices M as inputs, accepts good matrices, rejects 6-bad ma- 
trices, and is undefined on all other matrices. 

We will fix m = 4n^ and 5 = 1/2 in this paper. So the input size of the problem is N = 4n^. 

Since the complement of the function AppMP is not necessarily easy to compute by an AM-query 
algorithm, Vereshchagin defines a function for which also its complement is easy. 



12 



Definition 14 The function AppMPC takes pairs of Boolean nxn matrices M,M' as inputs. If M 
is good, and M' is 2/3-bad, then AppMPC(M, M') = 1, and if M is 2/3-had and M' is good, then 
AppMPC(M, M') = 0. In all other cases the function is undefined. 

We can now state the main result from |V95] in our terminology. 

Fact 7 1. For any polynomial with integer coefficients with degree d < n/2 that sign-represents 
the function AppMP, the threshold weight is at least 0.5e"/^^, i.e., H^(AppMP, n/2) > 0.5e"/^^. 

2. There is a constant ^ such that for any polynomial with integer coefficients with degree Q^/n that 
sign-represents AppMPC, the threshold weight is at least 2^", i.e., W^(AppMPC, Cv^) > 2^". 

We now define communication complexity versions of these problems via pattern matrices. 

Definition 15 The function PAppMP is the communication problem defined by the pattern matrix 
of the Boolean function AppMP. The function PAppMPC is the communication problem defined by 
the pattern matrix of the Boolean function AppMPC. 

We now state the obvious fact that AM-communication is small for PAppMPC and PAppMP. 

Lemma 3 1. AM(PAppMP) = O(logn). 

2. ^M(PAppMPC),AM(^PAppMPC) = O(logn). 

Proof. In an AM-protocol for PAppMP, the public coin random number i represents a random row 
of the matrix M which is the input to the function AppMP encoded in the pattern matrix. Merlin 
replies with a position j such that M{i,j) = 1, if such a j exists. Alice and Bob can easily verify with 
logarithmic communication whether M{i,j) = 1. They accept if this is the case. Clearly, if the i-th 
row of M does not contain a 1 they will not accept any proof. On (5-bad matrices this happens with 
probability at least 5. Good matrices M on the other hand have their inputs to their communication 
problem accepted with probability 1. The remaining protocols are along the same lines. □ 

We now turn to the lower bound. As it happens, all we really need to do is to (again) appeal to 
a result by Sherstov [S08b] (restated here for convenience). 

Fact 8 Let Pj be the pattern matrix of a function f , and d a positive integer. 
Then 



where W{f,d) denotes the threshold weight of f with degree d, i.e., the minimum threshold weight of 
a polynomial with integer coefficients and degree d, that sign-represents f . 

See section 2.2 for an explanation of threshold weight. Putting these results together we find 
that for d = n/100 the function AppMP has threshold weight at least 2^("\ and then the pattern 
matrix has discrepancy at least 2^'-"^ This readily implies that PP(PAppMP) = Q{n) with Fact HI 
Similarly, choosing d = C\/^ we get PP(PAppMPC) > VL{y/n). 

Theorem 5 1. PP(PAppMP) = 9.[N^I''^). 

2. PP(PAppMPC) = Jl(iVi/^). 

3. ^M(PAppMP) = O(logiV). 

4. ^M(PAppMPC),ylM(^PAppMPC) = 0(logiV). 
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5.2 QMA vs. AM 



In this subsection we note the following consequence of the lower bound in the previous subsection, 
which follows from the fact that PP-protocols can simulate QMA-protocols within a quadratic increase 
in communication (this can be proved by first boosting with Fact [H then removing the proof, which 
leaves a quantum protocol with a large enough gap. This can be turned into a weakly unbounded 
error quantum protocol, and such protocols are exactly as powerful as classical weakly unbounded 
error protocols |K07| ). 

Corollary 3 1. gMA(PAppMP) = ^{N^'^). 
2. AM(PAppMP) = OlogiV). 

6 Rounds in MA-communication 

For many 'realistic' modes of communication complexity there are problems that require the players 
Alice and Bob to interact by using many rounds of communication in order to achieve good proto- 
cols. This is not altogether surprising, since one would expect conversations with many rounds of 
interaction to be more powerful than monologues. Examples of this phenomenon are deterministic, 
randomized (see |NW 93] ) , and quantum communication complexity (see [KNTZOTl IJRS02| ) . 

However, in the nondeterministic mode of communication, monologues are in fact optimal: the 
prover can provide the whole conversation to the players, who now just need to verify that their role 
in the conversation is represented correctly. 

In this section we show that there is a partial function, for which every one-way MA-protocol 
with communication going from Alice to Bob is exponentially more expensive than a randomized 
one-way protocol with communication going from Bob to Alice. Note that such problems trivially 
do not exist when we replace MA- with nondeterministic, or AM-communication complexity. Raz 
and Shpilka [RS04| prove that one-way communication (in any direction) is also optimal (within a 
polynomial increase in communication) for QMA protocol^. Rounds in MA-communication do not 
seem to have been considered before, although Aaronson [ A06J considers a weaker variant of one-way 
MA-protocols: Merlin sends the proof to Bob only, so that Alice has to send her message without 
having seen the proof. This model is much weaker than the standard one-way MA-communication 
model, in fact at most quadratically more efficient than randomized one-way communication. 

Let us define the function. A usual suspect for this kind of separation is the Index function Ix, 
for which Alice receives a string x G {0, 1}", and Bob an index i £ {1, . . . ,n} and the goal is to 
compute Xi (see |KNR99 ] ) . But due to Bob's input being short, the nondeterministic complexity 
of this problem is small, and hence also the one-way MA-communication: The prover can simply 
provide Alice with Bob's input i. The problem we use instead gives Bob many indices, and we are 
trying to determine whether for many of them Xj = 1. 

Definition 16 The function Majlx(x,I), where I = {ii, . . . , each ij G {!,..., n}, and x G 

{0, 1}" is defined as follows: 

1. if \{j : Xi - = 1}| = ^/n then Majlx(x,/) = 1, 

2. if \{j : xi^ = 1}| < 0.9^™ then Majlx(x, /) = 0, 

3. otherwise Majlx(x,/) is undefined. 

^They prove that every problem with QMA-communication complexity c can be reduced to a problem called LSD 
of size 

2Poiyiogc^ for which a logarithmic QMA one-way protocol exists (they call this a two round protocol). 



14 



It is easy to see that i?-^"^ (Majlx) = O(logn), because Bob can just pick 100 indices ij from / 
randomly, and send them to Ahce, who accepts if and only if all Xij = 1. In AM-protocols a single 
round of communication from Alice to Bob is always optimal (and finding such a protocol for Majlx 
is an easy exercise). 

Intuitively, Merlin's problem with Majlx is that he cannot provide information about many indices 
in /, unless his proof is very long. However, it is not clear at all that this is necessary, and indeed, the 
same intuition would apply to the quantum case, in which there is a one-way protocol with poly log(n) 
communication and proof length for the problem. We prove the following lower bound, which is close 
to optimal. 

Theorem 6 M^^^-^(Majlx) > 

Proof. Before starting let us define the notion of a one-way rectangle, which is a basic object when 
considering randomized one-way communication complexity. 

Definition 17 For a (partial) function f : X x Y ^ {0, 1} a one-way rectangle is a subset R (1 X , 
coupled with a function (Ir-.Y ^ {0, 1, —1}. The one-way rectangle accepts all inputs {x,y) £ R x Y 
with dji{y) = 1, rejects all inputs {x,y) G RxY with dji(y) = 0, and is undecided about the remaining 
inputs in RxY . The error of a one-way rectangle under some distribution is defined in the obvious 
way. 

The size of small error one-way rectangles under distributions on the inputs characterizes the 
one-way randomized communication complexity |K04j . N08j . 

Now let us begin with the proof. We are given an MA"^^^ protocol, which has proof length at 
most a, and communication at most c, and error 1/3. We assume that a, c < 7-v/n for some small 
constant 7. As usual we boost the success probability by repeating the communication among Alice 
and Bob 100\/n times, so that the (soundness and completeness) error drops to e = 2"^°^^. Note 
that the proof does not need to be repeated, so after this step the proof length is still a, and the 
communication is c' < 5n for some small constant 5. El 

Our argument can now be summarized as follows: First we consider a distribution jj. on 1-inputs. 
We find and fix a proof for which many 1-inputs are accepted with high probability (we will identify 
proofs with the sets of 1-inputs that are accepted with high probability when using those proofs). 
After fixing the proof we are left with a randomized one-way protocol, that accepts all 1-inputs in 
the proof with probability 1 — e, but accepts 0-inputs with probability at most e each. Now we define 
a distribution a on 0-inputs. Finally, we show that under the distribution, in which 1-inputs are 
chosen according to jj, and 0-inputs according to a any large one-way rectangle must have large error. 
This shows that c' = 0(n), and hence a + c = VL{^/n). 

So let us begin with the distribution /i on 1-inputs. For this we employ a good error-correcting 
code to generate x G {0,1}". It does not matter whether the code is constructible or not, so a 
randomized construction suffices. A simple modification of the Gilbert- Varshamov bound gives us a 
code C C {0, 1}" that has distance n/4 and 2^'^'^^ codewords, each of which has Hamming weight 
exactly n/2. 

For the distribution ^ we first uniformly choose an element x £ C from the code. Denote by I 
the set of all sets / of ^/n different indices from {1, . . . ,n}. We continue by choosing an index set 
/ = {ij^j . . . G X, under the condition that all of the ij G I satisfy Xij = 1. This finishes the 

description of fi. 

^In the quantum case this appears to be impossible, because the Marriott- Watrous boosting technique does not work 
for one-way protocols. Hence here is a point where a QMA^^^ lower bound along these lines would fail. 
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In our MA-protocol there are at most 2" different proofs p. We identify each proof p with the set 
of 1-inputs {x,I), for which Ahce and Bob accept {x,I) with probability at least 1 — e when given 
proof p. Since completeness is 1 — e every 1-input is in at least one proof. No 0-input is in any proof. 
Now we simply fix the largest proof p under the distribution /x. Then > 2~". 

Having fixed our proof p, we are left with a randomized one-way protocol that accepts at least the 

1- inputs in p with probability 1 — e, and accepts no 0-input with probability larger than e. In order 
to show that this protocol needs communication J7(n) we create a hard distribution on all inputs, by 
mixing n with a distribution on 0- inputs: this distribution a is simply uniform on all 0-inputs. 

Note that there are more 0-inputs than 1-inputs to the function Majlx, due to the promise defi- 
nition. However, when we denote the total number of 1-inputs (x, I) with x € C hy K and the total 
number of 0-inputs {x,I) with x G C hy L, then a simple calculation reveals that L/K < 3^. We 
can conclude that for each 0-input {x,I) and each 1-input {x',I') & p we have 

■a{x,I). (1) 

Let ly be the distribution on all inputs, which results from mixing fj, and a with probability 1/2 
each. We may now fix the remaining randomness in our protocol and get a deterministic one-way 
protocol, that has communication c', and under v accepts a set p' C p of 1-inputs with ^{p') > 

2- "- ■ (1 - e)/2, but accepts a set q of 0-inputs with v{q) < e/2. Note that e < 2-"(l - e). 

To simplify our argument we will remove the 1-inputs that have limited contribution to the size 
oi p' . A string x G C is slim if 

z/({x} X X) ~ 

Let p" denote the set of inputs (x, /) G p' such that x is not slim. Then > 2~'=' • (1 - e)/2 - 2^"" > 

Our goal is to show that under the distribution v every one-way protocol with the above properties 
must have communication ^}{n). A one-way protocol partitions the inputs into one-way rectangles. 
Let us consider a one-way rectangle R,dji, such that there are x / y with x,y G C Ci R, and both 
x,y are not slim. Recall that x and y have Hamming distance n/4. 

Lemma 4Ifx,y are both in the same one-way rectangle, then the error (restricted to rows that are 
not slim) is 

K(^xcj^i(l))nMajlx-i(0)) ^ 2-2v^ 
uiR X X) - ■ 

Since the protocol accepts only a set of 0-inputs with size at most e/2 under z^, and accepts all 
inputs in p" , at most half of all inputs in p" (under u) can be in one-way rectangles that contain at 
least 2 different codewords as rows, else the error exceeds 2-"-3 • 2''^^ > e/2. 

This means that there must be 2^*^"') one-way rectangles in the protocol, and hence the commu- 
nication c' > n{n), which in turn implies c + a > Q{y/n), finishing our proof. 

□ 



Proof of Lemma [4l Consider a one-way rectangle R, d^. R contains at least two different 
codewords x £ C and y £ C (that are both not slim). We are interested in the restrictions that this 
places on d^. We identify x,y with subsets of {0, 1}". Then x n y < n(l/2 — 1/8) because x and y 
have Hamming distance at least n/4. Let S* C X be the set of / such that (x, /) G p" . Then for all 
I G S we have that all of the i G I must satisfy Xj = 1. 
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First let r C 5 be the set of all I e S, such that |/ n a; n y| > 0.8^/n. Then 

i^i^^-(o.8t^)-(o.2y^^-"^-(ji)' 

for some constant a > 0. Note that the binomial coefficient on the right hand side is just the number 
of / G I such that {x, I) is a 1-input. Hence 



u{{{x} X T) n p") 

S/eX:Majlx(a;,/)=l ^(^' ^) 



bmce wc can hmit a such that 2-2" > 2 • 2-"v^ the set T contributes little to the set of / G X with 

{x,I) e p": 

i^{{{x, I) : (x, /) G p" and I^T})> 2-2«-i • ^ zy(x, /) > 2-2«-2 . j,{{x} x X). 

7eX:Majlx(a;,/)=l 

So let US examine the set of all 7 G 5 such that |/ D x n y| < 0.8^/n. For all such / we have either 
|x n /| < .9^/n or |y H I| < .9^/n, i.e., cither {x,I) or (y, /) is a 0-input. Since we have assumed 
that {x,I) G p" , we get that {y,I) is a 0-input. Now 0-inputs have a smaller probability each than 
1-inputs, but on the other hands the allowed error e is very small. 

We can now calculate the error of the one-way rectangle in the rows x and y. 



> 



i^{{{y,I) : Majlx(y,/) = and dR{I) = 1}) 
v{{{x,I):IeS-T}) 



3V^ 

> 2-2«-2/3^ . X X}). 

We can play this game also with x and y exchanged and hence the two rows x,y together have 
substantial error. We can continue with more pairs of rows x, y of the one-way rectangle, until only 
one (or no) row is left. Hence the overall error is at least 2~^^. 

□ 



The proof technique used above can also be used to give a simple proof of a good lower bound for 
the randomized one-way communication complexity of the Index function Ix. For an error parameter 
e choose a binary code with distance e and size 2*^^"^*^^)"^. The hard distribution is the uniform 
distribution on the code times the uniform distribution on Bob's inputs. Whenever two codewords 
x, y are in the same message, the error in their rows together must be at least e. Hence most codewords 
must be in separate messages. 
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